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1 Introduction 

Many code packages have been created to perform rapid Monte Carlo integration of the phase 
space for collider physics processes, to simulate the effects of hadronization, and to emulate 
the response of detectors such as ATLAS and CMS. The most commonly used event gen¬ 
erator is arguably MadGraph5+aMC@NLO [1], with Pythia [2], Herwig-|-+ [3], Sherpa [4] 
and WHiZaRD [5] also commonly used for event generation. To simulate the detector re¬ 
sponse, including efficiency and isolation details, PCS [6] and Delphes [7] are the two most 
commonly used packages for theorists/phenomenologists, with a modihed version of Delphes 
being employed by the CheckMATE [9] code package, while full detector simulators are avail¬ 
able only to experimental groups. The collection and analysis of the events passing through 
the generation-through-detector-simulation process is often left to the individual, with some 
packages, such as Mad Analysis [8], existing to assist in the reproduction of experimental 
analyses, with varying learning curves. 

These tools are important for phenomenologists for two purposes ~ proposing new analy¬ 
ses that will be sensitive to a well motivated model or signature, and reproducing experimental 
analyses for a particularly motivated model to determine exclusions of parameter space. One 
of the challenges of performing these studies is in producing a sufficiently large number of 
events, especially of standard model (SM) backgrounds, to achieve an appropriate coverage 
of the full phase space where the analysis focuses. A good rule-of-thumb in generating back¬ 
grounds to reproduce an experimental analysis is to generate a sufficiently large number of 
events that the per-event weight is equal to the inverse of the integrated luminosity. Thus, 
any process with a nanobarn cross section at LHC8 with 20/fb integrated luminosity, would 
require 20 million events to be generated. Techniques can be employed at the event gen¬ 
eration stage to focus on the region of phase space of the SM backgrounds which overlap 
with the signal of interest, however this limits the reusability of these backgrounds for scans 
over parameter space that result in varying kinematic signatures. As a result, many analyses 
would require a large number of events to be generated. 

Common output file formats from the generation-hadronization-simulation process, in¬ 
cluding the Les Houches file format [10] and the ROOT hie format [11], produce large hie 
sizes that can be prohibitive to store for researchers without signihcant computing resources. 
Alternatively, the LHC Olympics (LHCO) hie format is a minimalist output hie format that 
can be produced by both PCS and Delphes detector simulation packages, and has the added 
bonus of shielding the user from all truth information regarding the events, thus eliminating 
the possibility of user bias in an analysis. This hie format allows a user to store a large 
number of events with relatively little disk space use. 
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This article introduces a new analysis package, entitled Seer, for the rapid analysis of 
an arbitrary number of LHCO files (limited by computer memory only, tested with over 850 
files of 50k events each, simultaneously).^ The goal of this package is to provide a simple 
way to combine multiple event generation files for many different processes, apply trigger 
requirements and kinematic cuts, and analyze and histogram the results. Many common 
kinematic variables are already implemented, including the popular stransverse mass {Mt 2 ) 
[15-17] variable and the Razor variables^ [13, 14] used to analyze SUSY decays, allowing 
usefulness out-of-the-box. Since the package employs ROOT libraries, modification of the 
code to add user defined kinematic calculations and cuts is simple and easy for new users. 

The installation of Seer is covered in Section 2. Following this, an in-depth discussion 
regarding the use of Seer is covered in Section 3, focusing on the “out-of-the-box” version. In¬ 
structions for modifying Seer to creatie user-defined cuts and kinematic variables is discussed 
in Section 4. Lastly, a series of examples are provided showing very basic analyses of Seer, 
and examples of the final output histograms. 

Of note, a variety of font face formats have been adopted to differentiate between refer¬ 
ences to files, references to text settings, and references to parts of the Seer code. As a brief 
legend, hies use a helvetica font, text settings use a true-type font, and references to code 
use a small true-type font. 

2 Installing Seer 

Installing Seer is straightforward. Seer has been tested on OS X 10.7-1- and Linux distributions 
using the GNU C-|—|- compilers. Future versions may address other compiler options. 

2.1 Pre-requisites 

Seer requires ROOT libraries, as much of the functionality uses classes dehned within ROOT. 
As a result, ROOT needs to be installed. Instructions regarding the installation of ROOT 
can be found at https: //root. corn, ch/drupal/content/downloading-root. Seer requires 
that the $R00TSYS system variable be correctly setup in order to function. 

2.2 Co-requisites 

Seer includes a copy of the MT2/stransverse mass calculator developed by Z. Han and H.C. 
Cheng [15-17]. To implement alternative versions of this code, the int2_bisect.cpp and 
mt2_bisect.h hies should be replaced in the Seer main directory. 

Any extra C-|—|- code a user wishes to add can be placed in the Seer directory and added 
to the makefile. The name of the header hie needs to be added to the $HEADERS variable in 

similar effort has been made with the CutLHCO program [12]. In addition, MadAnalysis can perform 
many of the same functions, though Seer offers some differences in both the input and output, and provides 
some different features. 

^Razor calculations use the script provided by CMS at https://twiki.cern.ch/twiki/bin/view/ 
CMSPublic/RazorLikelihoodHowTo 
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the makefile, and the object file to create needs to be added to the $0BJECTS variable, in 
addition to adding the appropriate include statement in the Seer code. 

2.3 Unpacking 

Seer must first be unpacked. This can be performed by navigating to the directory containing 
the tarball and running the command tar -zxvf seer. tar.gz. This will unpack all the files and 
directories needed by Seer. The following files are mandatory for Seer to compile and run: 

• seer.h & seer.cpp - header and C++ file for the main script. 

• eventweights.h & eventweights.cpp - header and C++ file for the script that pre-scans 
the LHCO hies for cross section and number of events. 

• calculations.h & calculations.cpp - header and C++ hie for the cuts/trigger/analysis 
class event. 

• plots.h & plots.Cpp - header and C++ hie for the plotting class plot. 

• Stylo. h - style hie containing a number of settings for the histograms. 

• mt2_bisect.h & mt2_bisect.cpp - header and C++ hie for the MT2 calculator. 

• seer_cuts.txt - contains all the cut and trigger settings 

• seer_fakes.txt - contains all the fake rate settings 

• seer_files.txt - contains the list of LHCO hies 

• Seer_plotS.txt - contains the plot settings 

• seer_settings.txt - contains all settings for the constants that can be used in the calcu¬ 
lations 

2.4 Compiling 

Seer is compiled by navigating to the directory in a terminal window and entering the make 
command. 

3 Using Seer 

Seer is run via a single command in a terminal, ./soer.exe. All details of the run are deter¬ 
mined by the settings in hve text hies: three primary hies (seerJiles.txt, seer_cuts.txt and 
seer_plotS.txt) and two secondary hies (seer_fakes.txt, and seer_settings.txt). The general 
process followed by Seer is given in Figure 1. A brief summary of the hies is included below, 
and detailed information is provided in individual sections for each hie. 
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list of all LHCO files 
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seer_fakes.txt 

settings for fakes 
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seer_plots.txt 

settings for plotting 
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seer.cpp 

Runs all scripts 


eventweights.cpp 

Cross section and 
event counts of all files 

add_plot() 

Creates groups of 
histograms \A/ith the same 
tag identifier, used to 
combine files together 

• s- ^ 

/-■ N 

add_title() 

Stores file location, 
tag, cross section, 
number of events, and 
indicator if file is part 
of a larger set of files 

s._ ^ 

^ N 

determine_weights() 

Scans through all event 
files, double checks 
event counts and cross 
section values 

V_y 


calculations.cpp 

Performs all calculations 


J 


set_event() 
add_particle() 

Add all particle entries 
to appropriate lists, 
adds extra smearing 
where settings dictate 

runJgnoreO 

Removes particles that 
meet specific criteria 

\s_^ 

run_trigger() 

Checks to see if event 
passes trigger requirements 

V,_ y 

N 

run_tag() 

Check to see if event 
meets the tagging criteria 

V_ y 


run_cuts() 

Performs boolean check 
to see if event kinematics 
pass the cut criteria 


calc_ps() 

Determines the value 
of the kinematic variable 
for plotting 


plots.cpp 

Combines histograms 


constructor 

Creates histogram 
canvas, pad and axes, 
sets up naming/labels 

set_preanible() 

Adds text to print out 
to accompanying txtfile 
{print out of all settings 
used in Seer, and all 
bin values in histograms) 

generate_plot() 

Reorders plots based on 
bkg/signal status and 
total cross section, 
creates cumulative sum 
of background bins 
for plotting 


Figure 1. Flow chart of Seer, describing where input settings are used, and the main flow of processes, 
and the files that contain each of the main functions. 


seer files.txt 

This is the first file that should be adjusted when starting an analysis. It contains the 
list of all LHCO files in the analysis, as well as details on their groupings. More information 
regarding this file is provided in Section 3.2. 

seer_cuts.txt 

This is the second file that should be adjusted. It contains the details of the tagging, the 
experimental trigger thresholds, fine tuning details of the detector limits, and details of the 
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cuts. More information regarding this file is provided in Section 3.3. 


seer_plots.txt 

This is the last primary file to adjust. It contains the settings required to determine what 
type of histogram to make, as well as the details of the histogram and aspects of the output 
file names. More information regarding this file is provided in Section 3.4. 

seer fakes.txt 

This is a secondary file, as the settings within are useful only for advanced users. Seer can 
be used to handle fakes instead of the detector simulator used. More information regarding 
this file is provided in Section 3.5. 

seer_settings.txt 

Some additional settings are contained in this file, including numerical constants used 
by Seer to add additional smearing to jet and missing transverse energy kinematics. More 
information regarding this file is provided in Section 3.6. 


3.1 Seer Output 

The output of Seer is two files - one plot hie and one text hie. The naming system for both 
of these hies is identical except for the extension, and follows the scheme: 

(plot_type)_(name_string)_n######.(file_type) 

where (plot.type) is hxed in plots.cpp and depends only on the (plotnum) setting in 
seer_plotS.txt, (name_string) is the setting in seer_plotS.txt that allows users to add a unique 
or descriptive text identiher to a particular analysis, and the sequence of numbers is the total 
sum of events of all hies included in seerJiles.txt. The output hie type for the histogram 
depends on the (file_type) setting in seerJiles.txt, while the text output hie is in a .txt 
format. More information regarding these settings is given in Section 3.4. Note that running 
Seer multiple types with the same (plotnum) and same hies in seerJiles.txt will result in 
overwriting previous hies. Users need to use the (name_string) held to differentiate between 
applying different cuts or other analysis details that do not change the hie name. 

The plot output hie contains a hgure of the histograms of the various groupings of hies 
included in se6rJiles.txt. The text output hie contains a header with a copy of every line in 
the hve input settings hies. Following that, it provides the following information: 

1. Individual File Summaries: for each hie, a section starting with the text “Now run¬ 
ning event hie...” provides the hie path and tag in a descriptive sentence, then gives a 
summary of the results of the analysis. 

• File Tag - tag given in seerJiles.txt 
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• File Event Weight - the event weight of the individual file (ignoring multiple files 
with same tag) 

• Combined Event Weight - the event weight used when accounting for combined 
files (wf) 

• Untagged Events - the number of untagged events, and corresponding cross section 

(<7untagged — ^untagged ^ ^/) 

• Cut Events - the number of events that were tagged but were cut in the analysis, 
and corresponding cross section (fJcut = ''^cut x tcj) 

• Tagged & Uncut Events - the number of events that were tagged and passed the 
cuts of the analysis, and corresponding cross section (uinduded = f^inciuded x rc/) 

• Total Events - the sum of the Untagged, Uncut and Tagged &: Uncut events, and 
corresponding cross section. 

• Cross Section - the final, included cross section passed to the histogram, including 
associated theoretical error ((Juinciuded = <7inciuded/ -v/^^inciuded; if no events pass 
cuts/tagging, (5crinciuded = Wf is used) 

2. Group Summaries: includes all events from all files with the same tag in seerJiles.txt. 

• Group Tag - tag of the group of files, as defined in seerJiles.txt 

• Untagged Events - the number of untagged events, and corresponding cross section 

(*^untagged ^untagged X Wj) 

• Cut Events - the number of events that were tagged but were cut in the analysis, 
and corresponding cross section (ficut = ncut x tc/) 

• Tagged & Uncut Events - the number of events that were tagged and passed the 
cuts of the analysis, and corresponding cross section (uinduded = f^inciuded x rc/) 

• Total Events - the sum of the Untagged, Uncut and Tagged &: Uncut events, and 
corresponding cross section. 

• Cross Section - the final, included cross section passed to the histogram, including 
associated theoretical error (Jugroup = '^'^individuai) 

3. Overall Summary; a list of the results from all files combined. This is the same infor¬ 
mation as in the group summaries, except now it includes all groups combined. 

4. Breakdown of Cuts; a list of the cross section removed by each individual cut. Since 
multiple cuts can remove an event, these are not exclusive cross sections. 

5. Number checks: a weight sum and total number of events check to make sure everything 
adds up correctly 

6. Histogram results: a tab separated text list of the bin centre and cross section values 
for each histogram (in fb) 
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Users who are interested in the total cross section before and after cuts can read this infor¬ 
mation directly from the relevant group summary in the text file. Users who are interested 
in distributions have a choice between the visual histogram and the text of the histogram 
values. Of note, the backgrounds in the text file are not summative, unlike the background 
histograms in the plot file, as discussed in Section 3.4. 

The following sections describe in great detail the text files that Seer uses to read in all 
of the settings for the run. It is important to note that a line can be commented out by using 
double hashtags, 

3.2 seer_files.txt 

Seer checks this file for the list of all LHCO files to be used in the analysis. For each file/line, 
there are four required entries and two optional entries. The format of these entries is as 
follows: 

(tag) (d/s) (fake#) (file_location) (cross_section) (n_events) 

The (cross_section) and (n_events) entries are optional, meaning that they can be ex¬ 
cluded from file without causing problems with the operation of Seer. These values are used 
to scale the per-event weighting. By default, Seer will read these values from the LHCO 
file directly if they are not set, but a user may wish to rescale the cross section and can do 
so to override the automatic scan performed by Seer. Further details of the entries in the 
seer_files.txt file are provided in Table 1. 

To better explain the behaviour of Seer, it is useful to examine some example scenarios 
to clarify the interplay between the (tag) and (d/s) settings. 

Generated pp —)• W~^W~ and pp —>■ Z, listed with a common tag (e.g. 
“background”) and with the flag “d”. This results in a single histogram 
being generated combining the events from both files, and with atot = 
o'ww + o'z- 

Generated 10 files of pp —)• W~^W~, listed with a common tag and the flag 
“s”. This results in a single histogram with total cross section averaged over 
all files {atot = the per-event cross section weighted 

by the inverse of the combined sum of events from all files. Note that these 
files have to be generated with the exact same phase space limits, otherwise 
combining them will not be valid. 

Generated 10 files of pp —?• each with different, non-overlapping ranges 

of mil (such as 0-100, 100-200, 200-300, 300-400, etc...), listed with a com¬ 
mon tag and with the flag “d”. This results in a single histogram with the 
combined total cross section {atot = This method results in signifi¬ 

cantly better sensitivity over the full range of phase space as compared with 
10 files with identical phase space. 

Files with unique tags will always be treated independently, and be plotted with a their unique 


Scenario 1: 


Scenario 2: 


Scenario 3: 


Table 1. Explanation of the settings for the seer_files.txt file. 


Parameter 

Explanation 

(tag) 

This should be an alphanumeric (including symbols ()-+=,.7)^%_). 
Multiple sequential entries of identical tags will result in these files 
being grouped together, with cross sections being combined based on 
the status of the (d/s) flag. Note that the tag choice here will also be 
the text entry in the legend for that file. 

(d/s) 

This is a flag with two possible values: “d” or “s”. A choice of “d” 
indicates that this file has a different phase space or process from other 
files with the same tag, and the cross section should be summed. A 
choice of “s” indicates that this file duplicates the same phase space 
as the previous file in the list, and so the total cross section should be 
averaged between all files with an “s” and the same tag. 

(fake#) 

This is a flag for adding in a pseudo-fake rate, and is a number between 

0 and 9, inclusive. This is discussed further in Sec. 3.5. 

(file_location) 

This is the path (full or relative) to the LHCO file, including the file 
name. 

(cross_section) 

This is a cross section value (in pb) that rescales the total cross section 
for that individual file only. This can be used to rescale total cross 
sections as desired, or to combine multiple different processes with the 
same tag, as discussed below. 

(n.events) 

This is the number of events that corresponds to the cross section 
value. This entry mnst accompany the (cross_section) entry when 
it is present. Valnes can be either pre- or post- matching (e.g. the 
(cross_section) and (n_events) settings 5.31 50000 would be the 

same as 3.5955 33856). 


tag. Thus, if there is only one file for a given tag, the choice of the (d/s) flag is unimportant. 

There is currently no way to automatically combine multiple files of each of two separate 
processes with the same tag (i.e. Scenario 1 bnt with multiple files of each of the processes). 
This is why the cross section and event number overrides are implemented. In order to nse 
this correctly, the user must manually reduce the cross section to the per-event value following 
the formula: 


a 


/ 

i 


(Ji X m 
k 


(3.1) 


where ai is the cross section listed in the LHCO file. If all files have the same number of 
events, then this works ont to simply ai/k. 

Determining the per-event cross section is only slightly more complicated when users 
are employing matching and merging algorithms in the generation stage. Currently, the 
MG5+aMC@NLO plus Pythia package lists the total cross section before matching and the 
total events generated before matching/merging, as well as the events after matching/merging. 
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in the LHCO file. In this case, the relationship CThefore/nbefore = Rafter/nafter holds (where 
before/after refers to before/after merging/matching), and so the per-event weight can be 
determined with CTbefore/nbefore, which is calculated at the MG5+aMC@NLO stage. 
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3.3 seer_cuts.txt 


In the seer_CLltS.txt file, four different types of entries are possible: tagging, trigger, ig¬ 
nores and cuts. These will each be addressed separately below. The general format for the 
seer_cuts.txt file is given by: 

Signal ## Do not alter this line! 

##TAGGING 

(nlow) (nhigh) (type), pt (ptlow) (pthigh), eta (etalow) (etahigh) 

Extra ## Do not alter this line! 

##TRIGGER 

trigger (type) pt (ptl) (pt2) (pt3) (pt4) 

##IGN0RES 

ignore (type) with (pt/eta) between (kinlow) (kinhigh) 
ignore (type) with (pt/eta) over (kinlow) 

##CUTS 

addcut (type) with (variable) between (kinlow) (kinhigh) 
addcut (type) with (variable) over (kinlow) 

Changeable settings are enclosed in parentheses. The explanations for each section (tagging, 
trigger, ignore, and cut) are given below, along with the explanation of each relevant setting 
entry. 

3.3.1 Tagging 

Tagging determines the particle combinations that Seer uses in its calculations. Tagging 
criteria are added after the “Signal” entry in the S6er_CUtS.txt file and before the “Extra” 
entry. When passing events to the cut section for analysis, only events that meet precisely 
the criteria in this list will be included. 

The format for an entry is: 

(nlow) (nhigh) (type), pt (ptlow) (pthigh), eta (etalow) (etahigh) 

These entries are not inclusive to particles that are not listed in the tag list. The seer process 
goes through the list of tagging entries and marks off every particle entry in an event as 
tagged. Once all tagging entries have been identified, the presence of any untagged particle 
in the event listing will result in the entire event being identified as “untagged”. There are 
two ways to make an analysis more inclusive (e.g. inclusive for jets) - add an extra entry of 
0-99 of that particle type with a wide range of kinematics, or add an entry to the ignore list 
(discussed in Section 3.3.3). To make it clear how this works, several examples are included 
below. 

Example 1: This example will only perform an analysis on events that contain between 
1 and 2 leptons (either muons or electrons) that have 10 < pr < 99999 GeV and within 
IpI < 2.5, as well as any number of light jets between 0 and 99 that have 10 < pr < 40 GeV, 
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Table 2. Tagging settings definitions and explanations. 


Parameter 

Explanation 

(nlow) 

This is the minimum number of that type of particle. 

(nhigh) 

This is the maximum number of that type of particle. 

(type) 

This is the type of particle - options include: 
ele - electron 

muo - muon 

lep “ lepton, includes muon and electrons, 
tau - hadronic taus 

Ijt - light jets (does not included 6 -tagged jets or hadronic tau) 
bjt - 6 -tagged jets 

jet ~ all hadronic jets (includes hadronic taus, light jets and bjets) 
gam - photons 

all - all particle types combined 

(ptlow) 

the minimum amount of transverse momentum (in GeV) for that specific 
type of particle 

(pthigh) 

the maximum amount of transverse momentum (in GeV) for that specific 
type of particle (99999 is often used to represent a value large enough that 
there is no upper cutoff in pt) 

(etalow) 

the minimum pseudorapidity for that specific type of particle (typically neg¬ 
ative value) 

(etahigh) 

the maximum pseudorapidity for that specific type of particle (typically 
positive value) 


and within I 77 I <5. Note, the presence of a Th or 6 -tagged jet of any pT, or a light jet with 
Pt > 40 GeV would cause the entire event to be rejected from the analysis. 

##TAGGING 

1 2 lep, pt 10 99999, eta -2.5 2.5 
0 99 Ijt, pt 10 40, eta -5 5 

Example 2: This example will include only events with exactly two high pT light jets 
(200 < Pt < 99999) and then be inclusive to all light jets with pT below 100 GeV. Note the 
double entry of the same type of particle. Seer will tag the items on the list in the order they 
are listed, and in order from high to low pT- 

##TAGGING 

2 2 Ijt, pt 200 99999, eta -2.5 2.5 
0 99 Ijt, pt 10 100, eta -5 5 

3.3.2 Trigger 

Triggers are implemented in Seer as minimum thresholds for pT required for an event to be 
considered for analysis. These are effectively treated as step-function cuts at the threshold 
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values. All triggers listed in seer_cuts.txt are combined, so any given event will be included 
in the analysis if it passes the requirements for at least one trigger. To effectively include 
all events, or rather to turn off the trigger, all of the trigger lines should be commented out 
(double hash tag, preceding the lines). 

The format for trigger entries is: 

trigger (type) pt (ptl) (pt2) (pt3) (pt4) 

The text for “trigger” and “pt” are both necessary and are not settings to be changed. The 
(type) entry is based on the list of possible triggers, not based on the type of particles. 
The currently implemented types of triggers are listed in Table 3. There are up to four pT 
thresholds that can be set. For trigger types of a single particle type (like jets, which can 
have , this should be the pT ordered list of the thresholds. For triggers of a combination of 
types 

3.3.3 Ignore 

Seer has an extra feature called “ignore”, which allows fine tuning adjustment of how Seer 
treats particle entries in an event. This is particularly useful when using a single, inclusive 
particle type in which there are different kinematic limits for each. For example, the ATLAS 
detector can detect electrons with pseudorapidities within |r/| < 2.47, and muons within 
\r]\ < 2.5. Asking Seer to tag general leptons within \r]\ < 2.5 would nominally include any 
electrons with 2.47 < \r]\ < 2.5. Other uses include specifying whether particle information 
is discarded, as in [? ], where it is stated that “Following this step, all jet candidates with 
|? 7 | > 2.8 are discarded.” This indicates that the event is retained even where jet candidates 
occur outside |? 7 | = 2.8, but that the information on those jets are not included in calculations 
of the kinematics that characterize the event. This kind of treatment is not handled in the 
Delphes detector simulation. 

An ignore setting tells Seer to remove a particle listing that meets that criteria from the 
event list, and from all sub-lists within Seer, but does not otherwise change the rest of the 
event. For example, an electron that is ignored is removed from the electron lists, the lepton 
lists, and the inclusive “all” list. Since ignores are run before tagging and trigger, as shown 
in Figure 1, particles that are ignored will not participate in those processes. 

There are two possible entry types for ignore statements, which are: 

ignore (type) with (pt/eta) over (kinlow) 

ignore (type) with (pt/eta) between (kinlow) (kinhigh) 

where text entries not enclosed in parentheses are text strings that are required and should 
not be changed. An “over” statement indicates that any particles with kinematic {pT or 
rj) values larger than the (kinlow) listed will be ignored. A “between” statement indicates 
that any particles with kinematic {pT or rj) values between (kinlow) and (kinhigh) will be 
ignored. In this case, the (eta) is absolute valued. Thus setting eta over 2.0 would be 
\i]\ > 2.0. Further descriptions of the limits for these settings is listed in Table 4. 
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Table 3. Implemented trigger types, with explanations. 


Trigger 

Explanation 

ele 

electron, four numerical options list the pT ordered trigger thresholds (a value 
of 0 turns that option off) 

muo 

muon, four numerical options list the pT ordered trigger thresholds (a value of 

0 turns that option off) 

gam 

photon, four numerical options list the pT ordered trigger thresholds (a value of 

0 turns that option off) 

tau 

hadronic tau, four numerical options list the pT ordered trigger thresholds (a 
value of 0 turns that option off) 

met 

missing transverse energy, only the first numerical option should list the missing 
Et threshold, the rest should be set to 0 

Ijt 

light jet, four numerical options list the pT ordered trigger thresholds (a value 
of 0 turns that option off) 

jet 

jet, includes b tagged jets, four numerical options list the pT ordered trigger 
thresholds (a value of 0 turns that option off) 

bjt 

b tagged jets, four numerical options list the pT ordered trigger thresholds (a 
value of 0 turns that option off) 

tae 

single hadronic tau plus electron, numerical option (ptl) is the pT threshold of 
the Th, numerical option (pt2) is the pT threshold of the electron ((pt3) and 
(pt4) should be set to 0) 

tarn 

single hadronic tau plus muon, numerical option (ptl) is the pT threshold of 
the Tfi, numerical option (pt2) is the pt threshold of the muon ((pt3) and 
(pt4) should be set to 0) 

jat 

single jet (includes b tagged jets) plus missing Et, numerical option is the 

Pt threshold of the jet, numerical option (pt2) is the ^t threshold ((pt3) and 
(pt4) should be set to 0) 

tat 

single Th plus missing Et, numerical option (ptl) is the pT threshold of the Th, 
numerical option (pt2) is the ^t threshold ((pt3) and (pt4) should be set to 
0) 

single electron and single muon, numerical option (ptl) is the pT threshold 
of the electron, and (pt2) is the pT threshold of the muon ((pt3) and (pt4) 
should be set to 0) 

earn 


Table 4. Description and explanation of options for ignore feature. 


Parameter 

(type) 

(pt/eta) 

(kinlow) 

(kinhigh) 


Explanation 

This can be any particle type from the following list; ele, muo, lep, tan, jet, 
Ijt, bjt, gam. 

This is a text string that is either “pt” or “eta” and tells Seer what the 
kinlow and kinhigh values represent. 

This is the lower threshold for the ignore statement. 

This is the upper threshold for the ignore statement. 
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Table 5. Description and explanation of options for addcuts feature. 


Parameter 

(type) 

(variable) 


(kinlow) 

(kinhigh) 


Explanation 

This can be any particle type from the following list: ele, muo, lep, tan, jet, 
Ijt, bjt, gam, met, all. 

For all particle lists (ele through gam in the above list), there is a common 
set of cut variables. For all and met, there are special cut variables. See 
Tables 6 and 7. 

This is the lower threshold for the cut statement. 

This is the upper threshold for the cut statement. 


Special Note: If a user is interested in being inclusive to a particular type of particle 
(assuming the event signature and trigger is based on other particle states), perhaps jets with 
Pt < 30 GeV, this can be done with a tagging statement of e.g. “0 99 jets, pt 0 30, eta -2.5 
2.5”. If the user were to calculate the total Ht of events using this method, these low pT 
jets would be included in the Ht calculation. Alternatively, if a user were to use an ignore 
statement of “ignore jet with pt between 0 30”, then events would be similarly inclusive of 
these jets however all Seer calculations would be blind to these jets - the low px jets would 
not be included in total Ht calculations or any other calculations that involve jets. Thus, a 
user needs to be aware of the effect of using ignore versus tagging statement on inclusiveness 
of events. 

3.3.4 Cuts 

One of the primary functions of Seer is in the application of cuts. Many types of cut options 
have been implemented by default. The addition of user-defined cuts is discussed in Section 
4. This section discusses only the default cuts. To clarify possible confusion between cuts 
and ignores: cuts remove the entire event if the criteria is met, where as ignores remove only 
the specific particle listing within the event if the criteria is met while leaving the rest of the 
event untouched. 

The format for cut entries is: 

addcut (type) with (variable) over (kinlow) 

addcut (type) with (variable) between (kinlow) (kinhigh) 

This format is very similar to the format for the Ignores listed previously. However, there are 
many more options available for the (type) and (variable) listings. Further explanations for 
the constraints on these settings are given in Table 5. Furthermore, the types of (variable) 
options are dependent on the particle (type) chosen. Details of the kinematic cuts available 
for each particle type are listed in Tables 6 and 7. 

Cuts that do not necessarily have a single particle type, cuts that depend on multiple 
types of particles, and cuts that are created by the user, require a (type) setting of “all”. 
The eta, pt, num, ht, im2 and mas cuts, discussed in Table 6 are also implemented for “all”. 
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Table 6. Cut variables common to particle (type) selections: ele, muo, lep, tau, jet, Ijt, bjt and gam. 


Cut Type 

Explanation 

eta 

Pseudorapidity of any particle of that type. If any particle of that type has 
a pseudorapidity value within the specified range, the entire event is cut. 

pt 

Transverse momentum of any particle of that type. If any particle of that 
type has a transverse momentum value within the specified range, the entire 
event is cut. This is the only cut type currently implemented for the “met” 
particle type. 

ptl 

Transverse momentum of only the leading particle of that type. Cuts events 
based only on the pT of the highest pT particle of that type. 

pt2 

Transverse momentum of only the subleading particle of that type. Cuts 
events based only on the pT of the second highest pT particle of that type. 

num 

Multiplicity of particles of that type. Cuts based on the total number of 
particles of that type in the event (this is highly influenced by the use of 
tagging versus ignores for inclusivity). 

ht 

Scalar sum of the pT of all particles of that type. If the total scalar sum of 
the Pt of all particles of that type is within the specified range, the event is 
cut. 

Invariant mass of any combination of two particles of that type (for leptons, 
the pairing must be opposite sign). If any pairing of particles of that type 
has an invariant mass within the range, the event is cut. 

im2 

mas 

Total invariant mass of all particles of that type. If the total invariant mass 
of all particles of that type lies within the range, the event is cut. 


All addition cuts that are also defined only for “all” are listed in Table 7. The “met” setting 
for (type) only has one possible (variable) option, which is “pt”. 

The following are a few examples for cuts that can be implemented: 

Example 1: Removing all events with light jet pT < 50 GeV is accomplished with the line 
addcut Ijt with pt between 0 50. 

Example 2: Removing all events with any combination of dilepton invariant mass within 20 
GeV of the Z mass is accomplished with the line addcut lep with ini2 between 72 112. 
Example 3: Removing all events > 50 GeV is accomplished with the line addcut met 
with pt over 50. 
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Table 7. Cut variables options available only to the (type) “all”. 


Cut Type 

Explanation 

mt2 

Stransverse mass variable. This only works if there are exactly 2 particles in 
the event (a user may use ignores to neglect low energy jets, for example), 
and will cut the event if there are more than 2 particles. It calculates the 
Mt 2 variable using the code developed by [15]. 

maa 

Leading diphoton mass. This first checks to see if there are at least two pho¬ 
tons in the event, and then cuts based on the invariant mass of the leading 
two photons. There is some overlap in purposes between this and the im2 
option in the previous table. For im2, it searches through all possible combi¬ 
nations of diphotons, but will not provide a cut if there is only one diphoton. 
This will cut the event if there is only one photon, and only looks at the 
leading two photons. 

mil 

Same as maa but for leptons. 

mjj 

Same as maa but for jets (all types). 

mtl 

Transverse mass of the only lepton (if only one), or the lowest pT lepton in 
the case of multiple leptons. 

mcol 

Collinear invariant mass of exactly two leptons (there can be any number 
of other particles), including taus. This calculates the invariant mass in 
the assumption that the missing transverse momentum is comprised from 
neutrinos that are collinear with the two leptons. 

osl 

Opposite sign (OS) leptons. Behaviour depends on the value of kinlow (kin- 
high ignored). If kinlow is 0, all events without an OS lepton pairing are cut. 
If kinlow is 1, all events with an OS lepton pairing are cut. 

sfl 

Same flavour, opposite sign (SFOS) leptons. Behaviour depends on the value 
of kinlow (kinhigh ignored). If kinlow is 0, all events without an SFOS lepton 
pairing are cut. If kinlow is 1, all events with an SFOS lepton pairing are 
cut. 

Opposite flavour, opposite sign (OFOS) leptons. Behaviour depends on the 
value of kinlow (kinhigh ignored). If kinlow is 0, all events without an OFOS 
lepton pairing are cut. If kinlow is 1, all events with an OFOS lepton pairing 
are cut. 

ofl 

ssl 

Same sign (SS) leptons. Behaviour depends on the value of kinlow (kinhigh 
ignored). If kinlow is 0, all events without an SS lepton pairing are cut. If 
kinlow is 1, all events with an SS lepton pairing are cut. 
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3.4 seer_plots.txt 

Seer can create either standard one-variable histograms, or can plot two-variable histograms. 
For one-variable histograms, the y-axis contains either cross section values (in fb) or a fraction 
of events, while the x-axis contains the binned kinematic variable value. Note that bins values 
are not divided by bin width, so Seer does not plot da/dx. Two-variable histograms have 
binned kinematic variables on both the x- and y-axes. The way that the bin values are plotted 
in two-variable histograms depends on whether there is one event type added or multiple event 
types. For a single event type (i.e. a single signal file), the histogram is plotted with a colour 
mapping for the cross section, with an associated scale. For multiple event types (i.e. signal 
files plus different backgrounds). Seer plots a pseudo-Dalitz style plot using the box option 
for the TH2 class in ROOT. 

For one-variable histograms. Seer distinguishes between two different kinds of events: 
signal and background. Backgrounds are additive, in that each histogram is the sum of the 
bin values of all smaller cross section histograms, plus the bin values for the cross section of the 
specified background. The order of histograms is thus from largest to smallest background, 
and the bin values for the largest cross section is in fact the sum of all backgrounds, so the 
user does not need to manually combine backgrounds to estimate the total background cross 
section. Backgrounds are plotted with a solid fill colour. 

Files considered to be part of the signal are not additive, but are re-ordered to plot 
from largest total cross section to smallest. In addition, signal histograms do not use a fill 
colour and are only a contour, and are plotted on top of all the backgrounds. Signals are 
distinguished from backgrounds based on the text string in the “tag” entry in the seer_files.txt 
file. 

The relevant entries in the seer_plotS.txt file are, where adjustable settings are enclosed 
in parentheses: 

Disable = (disable) 

PlotType = (plotnum) 

PlotType2 = (plotnuin2) 

PlotMaxX = (xmax) 

PlotMinX = (xmin) 

PlotMaxY = (ymax) 

PlotMinY = (ymin) 

NumBinX = (nbins) 

NumBinY = (nbins) 

ChooseLn = (log) 

Normalize = (norm) 

PlotTxt = (name_string) 

SigPref = (signal_string) 

FileType = (file_type) 

The explanations for each of the entries in this file are given in Table 8. 
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Table 8. Plotting settings definitions and explanations. 


Parameter 

Explanation 

(disable) 

This is a flag to disable the plotting features of Seer (1 = disable, 0 = 
enable). The text output is still produced. Seer has a maximum of 50 
histograms (distinct (tag) entries in seerJiles.txt), and this flag will 
necessarily need to be implemented if this number is exceeded. 

(plotnum) 

This is the kinematic variable that is plotted on the r-axis. The 
list of possible kinematic variables is included as comments in the 
seer_plotS.txt file. The list is too long to include in this document. 

(plotnum2) 

This is the kinematic variable that is plotted on the y-axis, similar 
to (plotnum). A value of 0 turns this feature off, and Seer plots a 
one-variable histogram. 

(xmax) 

This is the maximum value on the x-axis. 

(xmin) 

This is the minimum value on the x-axis. 

(ymax) 

This is the maximum value on the y-axis, which is cross section in 
units of fb. (Future functionality will include typing “auto” for this 
setting.) 

(ymin) 

This is the minimum value on the y-axis, which is cross section in units 
of fb. (Future functionality will include typing “auto” for this setting.) 

(nbinx) 

This is the number of bins to plot between (xmin) and (xmax). 

(nbiny) 

This is the number of bins to plot between (ymin) and (ymax). 

(log) 

Only considers values of 0 or 1. A value of 1 sets the y-axis or z-axis 
to use a logscale. A value of 0 sets the y-axis or z-axis to use a linear 
scale. 

(norm) 

Only considers values of 0 or 1. A value of 1 normalizes the binned 
events to the total binned cross section, and the y-axis (z-axis) contains 
event fraction values. A value of 0 sets the y-axis (z-axis) to plot cross 
section values. 

(name_string) 

This adds a text string to the output file names. Useful for distin¬ 
guishing between subsequent runs with identical files, but different 
settings. 

(signal_string) 

This is the identifying text of the prefix of the signal file tags. For 
example, if the tag line in seerJiles.txt for all signal files is “sig”, this 
setting should be set to “sig”. All such files will be plotted differently, 
as discussed above. 

(file.type) 

This tells Seer the file extension for the produced histogram. Common 
options are “pdf’, “eps” and “png”. Possible options are the same as 
the types of files known to ROOT. 
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3.5 seer_fakes.txt 


For event simulators, fake rates are typically handled internally. In Delphes, there is a prob¬ 
ability that a jet will fake an electron, for example. Fake backgrounds can have event rates 
similar to that of rare signals if there is a combination of a very large background with a 
very small fake rate. Estimating the cross section that overlaps with the signal can be very 
challenging using events from the detector simulator; if the fake rate is 1/1000, then out of 
50000 events generated, approximately 50 events will be useful. Generating enough statistics 
thus takes a long time. 

To improve on this. Seer has a fake-rate multiplier implemented. In effect, it allows for 
the possibility of using a much larger event fraction while simulating the effect of the fake 
rate as a re-scaling of the overall cross section. For example, if a user is interested in the 
contribution of VF-|-jets to e+e" signals, and has generated events of W + nj, the cross section 
can be rescaled by the fake rate, and one of the jets can be randomly selected to be added to 
the electron lists. 

Note: This can over-estimate the fake background if the fake rate is included 
in both the detector simulator and Seer. Thus, if Seer is going to handle fake 
rates, it is required that the fake rate be turned off in the detector simulator. 

Control over this process is handled by the S6erJakes.txt file. This file is structured as 
follows, where all setting entries are enclosed in parentheses: 

Rates 

(rstate) (fstate) (efficiency) 

Schema 
scheme 0 
scheme 1 

addreal (nlow) (nhigh) (rstate), pt (ptlow) (pthigh), eta (etalow) (etahigh) 

addfake (rstate) (fstate), pt (ptlow) (pthigh), eta (etalow) (etahigh) 

scheme 2 

scheme 3 

scheme 4 

scheme 5 

scheme 6 

scheme 7 

scheme 8 

scheme 9 

There are 10 total fake schemes that can be implemented. It is recommended that scheme 
0 be left empty, as a “no fakes” option. Note that the scheme number is the same number 
that should accompany the file in seerJiles.txt. These setting entries are defined in Table 
9. There can only be one fake rate defined (under Rates) for each combination of (rstate) 
and (fstate), but there can be multiple entries of the same combination of (rstate) and 
(fstate) as an addfake. 
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Table 9. Fake rate setting definitions and explanations. 


Parameter 

Explanation 

(rstate) 

This is a real particle state from the list of lep, ele, muo, bjt, gam, tau, 
Ijt. 

This is a fake particle state, where the possibilities are limited depending 
on the corresponding rstate. Real light jets (Ijt) can fake ele, tau, bjt 
and gam. Real photons (gam) can fake ele. Real electrons (ele) can fake 
photons. 

(fstate) 

(efficiency) 

This is the probability of the rstate faking the fstate. 

(nlow) 

This is the minimum number of a real particle 

(nhigh) 

This is the maximum number of a real particle 

(ptlow) 

This is the minimum px of a real particle 

(pthigh) 

This is the maximum pT of a real particle 

(etalow) 

This is the minimum ry of a real particle (etalow = -etamax for symmetric) 

(etahigh) 

This is the maximum ry of a real particle 


The “addreal” entries list the requirements needed for Seer to consider the faking of an 
event. In the example of IT + nj faking the assumption is that one electron comes 

from the decay of the W and the other one is a faked light jet. Thus, the user would set 
addreal 1 1 ele, pt 10 99999, eta -2.5 2.5, which will look for any event that already 
has exactly one electron (if this is the only addreal, then the other states are inclusive) within 
the specified kinematic range. If one is found, then Seer will pass the event along to determine 
the fake particle. 

The “addfake” entries determines how Seer fakes the state. First, Seer will look among 
all of the (rstate) particles in the event and identify which ones meet the kinematics listed. 
It will then randomly select a particle from that list to turn into the (f state) type, removing 
the particle from the appropriate lists and adding it to the fake state hst(s). In the example of 
W + nj faking e'’'e“, the user would set addfake Ijt ele, pt 10 99999, eta -2.5 2.5, 
which will take one of the light jets from within that kinematic range and turn it into an 
electron. In the case of an electron, the charge of the electron is determined randomly, as 
well. Thus, it is possible that a fake electron from W + nj will result in either same sign 
electrons or opposite sign electrons. 

It is important to note the event weight for all events within the file, whether a faked 
particle is present in the event or not, is scaled by the product of all the fake rate efficiencies 
for each “addfake” entry in the scheme being used for that file. Continuing with the same 
example, if the fake rate of jets as electrons is 0.0001, then any hie that uses that scheme will 
rescale the total cross section by 0.0001. Thus, any instances of e'’'e“ within those hies will 
have their weight rescaled. This is why processes that fake the signal need to be separated 
from processes that produce the signal when using the Seer faking system, and why fake rates 
need to be turned off at the detector simulator level. 
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3.6 seer_settings.txt 

There are a number of constants that Seer uses for various calculations. The numerical value 
of these settings can be adjusted in the S66r_settings.txt file. The format for this file is: 


mlsp = 

(mass_lsp) 

wmas = 

(W_mass) 

wwid = 

(W_width) 

zmas = 

(Z_mass) 

zwid = 

(Z_width) 

tmas = 

(t_mass) 

twid = 

(t_width) 

jeer = 

(jE_smear) 

jmer = 

(jM_smear) 

jher = 

(jh_smear) 

metA = 

(MET_scale) 

metP = 

(MET_pileup) 

metM = 

(MET_minbias) 


Further information about these settings, as well as the default values for them, is listed in 
Table 10. 

The Mt 2 calculator included with Seer requires an initial guess of the mass of the LSP 
particle, which is set in this file. The W, Z, and t mass settings are currently not used within 
any calculations. However, their inputs may be useful for any tagging or kinematics that 
depend on this values. An example where this may be useful would be the cuts employed in 
the ATLAS tri-lepton search for supersymmetry, where the phase space is separated into bins 
based on the dilepton invariant mass being smaller, larger or within a window about the Z 
pole mass. 

Additionally, Seer allows for the possibility of adding greater smearing to the jets and 
missing transverse energy values. All smearing values are calculated using a Gaussian dis¬ 
tributed random value centred at zero with widths dependent on the type of jet being 
smeared. For hadronic jets (applies to light jets, h jets and Th jets), additional jet pT will 
be smeared with a width of ge = (jE.smear) x jet mass will be smeared with a width 
= {jM.smear) x mj (useful particularly for large R fat jets), and the p and 4> values 
will be independently smeared with a width gr = (jh.smear) x p{4>) (where (jE.smear), 
(jM_smear) and (jh.smear) are the relevant quantities from S6er_S6ttings.txt). For missing 
transverse energy, smearing is performed independently on the x and y components with a 
width given by the formula from [18]: 

Gjf^ = A(0.40 + 0.09\/p) Et + PxM (3.2) 

where A is an overall scaling factor given in Seer by the variable metA (used to account for 
existing smearing in the detector analysis), P is the average number of pileup events given 
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Table 10. Explanations and default values for the 


Parameter 

Default 

Explanation 

(mass.lsp) 

0 

sets the value of the mlsp variable (mass of the LSP setting 
used for MT2 calculations) 

(Wunass) 

80.385 

sets the value of the wmass variable (currently not used) 

(W_width) 

2.085 

sets the value of the wwidth variable (currently not used) 

(Zunass) 

91.1876 

sets the value of the zmass variable (currently not used) 

(Z.width) 

2.4952 

sets the value of the zwidth variable (currently not used) 

(tjnass) 

173.21 

sets the value of the tmass variable (currently not used) 

(t.width) 

2.00 

sets the value of the twidth variable (currently not used) 

(jE.smear) 

0.00 

sets the value of the jet_E_error variable (used to add additional 
Pt smearing) 

(jM_smear) 

0.00 

sets the value of the jet_m_error variable (used to add addi¬ 
tional jet mass smearing) 

(jh_smear) 

0.00 

sets the value of the jet_eta_error variable (used to add addi¬ 
tional rj/cj) smearing) 

(MET_scale) 

0.75 

sets the value of the met_scale variable (used to scale MET 
smearing) 

(MET.pileup) 

30 

sets the value of the met.pileup variable (average of pileup 
events for MET smearing) 

(MET_m inbias) 

20 

sets the value of the met_min_bias variable in GeV (minimum 
bias energy for MET smearing) 


in Seer by the variable metP, and M is a minimum bias energy given in Seer by the variable 
metM. The sum of visible transverse energies should be in GeV, as should the value of M. 
The numeric factors are fitted values from [18]. 

3.7 Command Line Input/Output 

Seer is run using the command ./seer.exe in the main folder. The seer terminal output 
provides most of the same information as is in the text file output with a few minor differences. 
The differences are: 

• The descriptions of the settings files are more descriptive, rather than an identical copy 
of the lines in the file. 

• Individual file results are not listed, only group summaries and the overall summary is 
listed. 

• The histogram information is not listed. 

The information in the text output file is meant to be useful for recreating the analysis, 
while the terminal output is meant to be a more descriptive explanation and summary of the 
analysis. 
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There are three possible arguments that can be input (individually or in any combination) 
which will force Seer to run with extra text output to the terminal (not to the output text 
file). These three arguments represent three modes: 

• debugmode - used to print out information about nearly every step of the analysis 
process. This is useful to use when Seer crashed mid-run to determine where the problem 
lies. 

• rejectmode - used to print out a detailed output of all particles within an event only 
when that event is rejected. This is useful for analyzing each event that is cut to 
determine if the cuts are working as intended. 

• verbosemode - turns on output of the individual file summaries into the terminal output, 
and output of an initial verification of the cross section and event counts in the LHCO 
files. 

Note that the debugmode and rejectmode modes produce an excessive amount of text, and 
it is recommended that they be used only in conjunction with a test LHCO file that contains 
around 10 events. Creating such an LHCO file is easy and does not require modifying the 
cross section or event count entries within the file. Such a test file can be created by making 
a copy of any of the signal or background files being examined and highlighting and deleting 
all but the first 10 events. (Note: deleting the extra carriage return at the end of the LHCO 
file can result in problems within Seer.) 

4 Modifying Seer 

There are two aspects of Seer that a user may wish to modify. The first is user-defined cuts, 
and the second is user-defined plot variables. Both of these require an understanding of how 
the information is stored in Seer, and then an understanding of how that information will be 
interpreted and used within Seer. This will be discussed in this section. 

4.1 Accessing information in calculations.cpp 

Seer stores the information for each particle within a series of vectors of the TLorentzVector 
class in ROOT. This information is stored only for the current event, and is cleared upon 
reading in a new event. The following vectors of particles are available: 


std:;vector<TLorentzVector> 
std:;vector<TLorentzVector> 
jets) 

std:;vector<TLorentzVector> 
std:;vector<TLorentzVector> 
std:;vector<TLorentzVector> 
std:;vector<TLorentzVector> 


v_gam; 

// 

v_jet; 

// 

v_bjt; 

// 

v_ljt; 

// 

v_tau; 

// 

v_lep; 

// 


photons 

all jets (includes b-tagged and hadronic tau 
only b-tagged jets 

only non-b-tagged and non-hadronic tau jets 

only hadronic taus 

both electrons euid muons 
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std:;vector<TLorentzVector> v_liie; // 
electrons and muons) 
std:;vector<TLorentzVector> v_lpo; // 
electrons and muons) 
std::vector<TLorentzVector> v_ele; // 
std::vector<TLorentzVector> v_ene; // 
std::vector<TLorentzVector> v_epo; // 
std:: vector<int> v_ele_charge; // the 
order as v_ele 

std::vector<TLorentzVector> v_muo; // 
std::vector<TLorentzVector> v_mne; // 
std::vector<TLorentzVector> v_mpo; // 
std:: vector<int> v_muo_charge; // the 
as v_muo 

std::vector<TLorentzVector> v_met; // 
std::vector<TLorentzVector> v_all; // 
std:: vector<int> v_lep_charge; // the 
as v_lep 


only negatively charged leptons (both 
only positively charged leptons (both 
all electrons 

only negatively charged electrons 
only positively charged electrons 
charge of electrons in v_ele, in the same 

all muons 

only negatively charged muons 
only positively charged muons 
charge of muons in v_muo, in the sernie order 

missing treuisverse energy 

all particles, except missing tranverse energy 
charge of leptons in v_lep, in the same order 


There are six types of particle entries in LHCO files, based on the number in the second 
column for the particle listing (referred to as tag). These options are 0 (photon), 1 (electron), 

2 (muon), 3 (hadronic tau), 4 (jet), and 6 (missing transverse energy). Furthermore, the 
sign of the entry for number of tracks indicates the charge for electrons and muons, and the 
b-tag field indicates whether a jet is a b-tagged jet. This is the information that is used to 
determine which vectors to add the particle entries. These vectors are not exclusive, such 
that almost every particle entry exists in multiple lists. The reason for this methodology is 
to make modifying and adding new calculations more easy for users, at the expense of being 
more memory intensive. 

Users have full access to the TLorentzVector functions for each of the vectors of these ob¬ 
jects. The list of functions is available on https: //root. corn. ch/root/html/TLorentzVector. 
html. 

Note: If the user defines their own vector class object, this should be cleared within the 
event_destroy() function in calculations.Cpp. 

4.2 User defined kinematic variables 

The following steps will walk users through the process of adding code to calculate a new 
kinematic variable to Seer. 

Step 1: The function needs to be defined within calculations.h. For an example, a calculation 
of the leading dijet plus leading lepton invariant mass is discussed through the rest of 
these steps. The function could be defined with the code: 

virtual double calc_Mjjl(); 
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Step 2: Following this, the function needs to be dehned within calculations.cpp after the text 
“This area is for user defined functions.” Templates are provided in the code to under¬ 
stand the format. Continuing with the example, the code could be: 

double event::calc_Mjjl0 { 

if ( v_jet.size() >= 2 && v_lep.size() >=1 ) { 

return (v_jet.at(0)+v_jet.at(l)+v_lep.at(0)).M(); 

> 

return -99999; 

} 


In this example, a few important things should be noted: 

(a) The function must first check that the jet and lepton vectors have the appropri¬ 
ate size or the user will have errors during calculation that will stop Seer from 
functioning. 

(b) The particle vectors are ordered from largest to smallest pT, and v_jet.at(0) and 
v_jet.at(l) are the leading and subleading jet. For other possible combinations 
of jets from a larger list of jets, the user will need to define boolean conditions to 
select the desired jets from the list. 

(c) This example does not distinguish between b-tagged jets, hadronic tau jets, and 
light jets. To use only light jets, the user could instead use v_ljt instead of v_jet. 

(d) Returning a value of 0 if there are insufficient jets and leptons to calculate a mass 
can result in the 0th bin being incorrectly filled with events that should not be 
plotted. Using a value that is unreasonable for the plotting scale will ensure that 
such events do not appear on the histogram. 

Step 3: To add plotting capabilities, allowing this new function as a plotting variable, two 
changes must be made. The first is to add the function to the list of variables output by 
the function calc_ps(int hist_type) within calculations.Cpp. The function needs to be 
added to the end of the list of else-if statements with an incremented reference number 
(see Section 3.4 for more details on the reference number, (plotnum)). Continuing with 
the example, the code could look like: 

else if ( liist_type == 49 ) { 
event_ps = calc_Mjjl(); 

} 


Step 4: The second change to allow plotting requires modifying the plots.cpp file. The area to 
modify, along with a template, is provided following the text string “USER AREA.” 
Using the same (plotnum) value from calc_psO, the name_text, title_text, yaxis_text 
and xaxis_text strings must be modified as needed. The name_text variable is a string 
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that will be added to the filename of the output files (see Section 3.4 for more infor¬ 
mation). The title_text variable is a string that will nominally be added as a title 
to the ROOT canvas, however this is currently disabled in the style file, Style.h. The 
yaxis_text and xaxis_text are ROOT-LaTeX strings that will become the labels on 
the y and x axes, respectively. Currently, the y-axis in Seer output plots is only cross 
section values in units of fb, so the yaxis_text variable does not need to be changed. 
Continuing with the example, this code could look like: 

else if ( plot_type == 49 ) { 
name_text = "mjjl"; 
title_text = "Dijet+Lepton Mass"; 
yaxis_text = "#sigma (fb)" ; 
xaxis_text = (GeV)"; 

> 


At this point, all necessary changes have been made to create plots of with this variable. 
The number 49 is now an accessible (plotnum) in S6er_plotS.txt. This kinematic variable can 
also be implemented for use as a cut calculation, but this is a more nuanced issue that will 
be discussed in the next section. 

4.3 User defined cuts 

User defined cuts can be implemented using two methods. The first is using a boolean function 
that cuts based on some (complex) criteria. The second is by using a kinematic function. 
The key to implementing cuts is in setting both the cuttest and intcuttest variables to be 
true if the condition is met. The logic used in Seer is that if cuttest is TRUE, the event will 
not be included in the analysis, and it will not be passed to the plotting function. 

The variable intcuttest is reset for each run through the different cuts implemented, dis¬ 
tinguishing which cut settings eliminate the event (since an event may be cut by multiple cut 
conditions). Alternatively, cuttest is not reset through each cut, and so it simply represents 
an overall test whether the event is cut. This is why both variables must be set to be TRUE. 
The following steps outline the procedure to define a boolean function to Seer. 

Step 1: The function needs to be defined within calculations.h. The naming scheme for boolean 
functions used for cuts is a prefix of cut_. As an example, a function that cuts based on 
number of jets with a complex combination of pT thresholds is discussed through the 
rest of these steps. The function could be defined with the code: 

virtual bool cut_jets(); 


Note that this function does not make use of the (kinlow) and (kinhigh) variables 
from seer_cuts.txt, and is therefore a hard coded function. Users may wish to access 
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information from (kinlow) and (kinhigh) for a more nuanced cut. To do this, the 
user could define the function as: 

virtual bool cut_jets (double kin_low, double kin_high); 


If the user expects integer input for different cases, based only on the value of (kinlow), 
the function could be defined as: 

virtual bool cut_jets(int kin_low); 


Note that this would require recasting the double valued kin_low variable as an integer 
at the point of use of the function. The following steps assume that the second of these 
three function definitions is implemented. 

Step 2: Following this, the function needs to be defined within calculations.cpp after the text 
“This area is for user defined functions.” Templates are provided in the code to under¬ 
stand the format. Continuing with the example, the code could be: 

bool event::cut_jets (double kin_low, double kin_high) { 
if ( v_jets. sizeO >= 4 && v_tau.size() == 0 ) { 

if ( v_jet.at(0).Pt() > kin_high && v_jet.at(l).Pt() > kin_low && 
v_jet.at(3).Pt0 > 50 ) { 
return true; 

> 

> 

return false; 


For this code, a few important things should be noted: 

(a) This function requires more than 4 jets with 0 tau jets, thus it is inclusive to 
b-tagged jets and light jets, but not hadronic taus. 

(b) This function requires that the leading jet has a pT larger than the value of kin_high, 
and a subleading jet with a pT larger than the value of kin_low. 

(c) This function requires that the fourth jet (v_jet.at(3)) has a pT > 50 GeV. Since 
the jet list is pT ordered, this also means that the third jet (v_jet.at(2)) will have 
a Pt > 50 GeV. 

Step 3: Following the definition of the function, the cut must be implemented within the 
run_cut() function in calculations.cpp. Templates have been provided following the 
text string “This area is for user defined cuts.” Adding the cut first requires defining 
a unique string that will be used to identify the cuts in seer_cuts.txt, which will be 
tested against the variable kin_type. The second issue requires understanding exactly 
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the condition of the function which will result in cuts. This will be more clear in the 
context of continuing the example, which can be implemented with the code: 


if ( kin_type == "myjet" ) { 

if ( !cut_jets(kin_low,kin_high) ) { 
if ( debugmode ) { 

std::cout << "Event rejected because of a myjet cut" << std::endl; 

> 

cuttest = true; 
intcuttest = true; 

> 

} 


A few important things should be noted: 

(a) This script sets cuttest to TRUE if the return of the cut_jets() function is FALSE. 
This is an important distinction. Thus, events that don’t have jets falling within 
the kinematic constraints defined in the function will be removed. Close attention 
to the logic is necessary to properly code these cuts. When cuttest is TRUE, the 
event is removed. 

(b) The debugmode feature has been implemented to address the complexity of these 
cuts. Running in debugmode on a reduced sample of events (~ 10 events is a good 
number) is useful for testing the behaviour. 

(c) This is not optimized coding, written in order to make understanding the behaviour 
easier. 

The procedure for performing a cut on user defined variables is very similar. Steps 1 and 
2 from the previous section need to be followed to create the kinematic calculation function. 
Following this, Step 3 from this section outlines the basic procedure to implement the cut. 
There is a template provided within the function run_cut() in calculations.cpp. For the 
example of the dijet plus lepton invariant mass function defined in the previous section, the 
following code would implement a cut based on the kin_low and kinJiigh values: 

if ( kin_type == "JJL" ) { 

if ( v_jets.size0 >= 2 && v_lep.size() >= 1 ) { 
testvalue = calc_Mjjl(); 

if ( testvalue > kin_low && testvalue < kin_high ) { 
if ( debugmode ) { 

std::cout « "Event rejected because of a JJL cut of any particles with 
" << testvalue « std::endl; 

} 

cuttest = true; 
intcuttest = true; 

> 


- 29 ~ 





> 

else { // This else statement will result in removal of all events that do not 
have at least 2 jets and at least 1 lepton, 
cuttest = true; 
intcuttest = true; 

} 

> 


In this code, the calculated kinematic variable is passed to the testvalue variable. This 
is to save on computation time, which is especially important for more complex codes like 
the stransverse mass, calc_MT2(), and razor, calc_MR(), functions, which would add significant 
computing time to compute once for each of the comparisons to kinJow and kin_high. Also note 
that the dijet plus lepton mass can only be calculated on events with at least two jets and at least 
one lepton, so an extra logical test has been added to cut events that don’t meet these minimum 
requirements. 

In either of the two cases presented, adding these functions in the region outlined opens up the 
ability to add addcut lines to the seer_cuts.txt file such as: 

addcut all with myjet between 300 400 
addcut all with JJL between 0 500 

Based on the definition of the myjet cut, the first line would remove event if the there were fewer than 
4 jets, the leading jet had a pr < 400 GeV, the subleading jet had a pr < 300 GeV, or if the third 
and fourth leading jets had pt < 50 GeV. Any or multiple of those conditions would result in removal 
of the event from the analysis. The JJL cut is more straightforward, as it will remove events in which 
the dijet plus lepton invariant mass is between 0 and 500 GeV. 

4.4 Other modifications to Seer 

Users may wish to make other modifications to Seer. A few of these changes will be addressed here. 

4.4.1 Changes to histogram colour schemes 

Currently, Seer checks the number of histograms to plot and chooses from a few spectral choices. 
For a different colour scheme, users can alter the settings within the constructor of the plot class 
within plots.cpp. The colours available in ROOT are listed at https://root.cern.ch/root/html/ 
TColorWheel.html. The three different colour scenarios are for 1-9 histograms, 9-15 histograms, and 
more than 15 histograms. It is not recommended that users plot more than 15 histograms, however, 
except in very specialized situations, as the lines plots will become increasingly busy and difficult to 
read. However, Seer can effectively handle up to 50 overlapping histograms. 

4.4.2 Changing canvas/axis settings 

The canvas and axis for the output histogram is set at the end of the constructor of the plot class 
within plots.cpp. This area is labeled by a comment string stating “Canvas/Axis settings”. This area 
controls the offset and size of the axis labels, as well as the axis tick settings. Some default settings 
also exist within the Style.h file. Modifications to Style.h should be handled by users who are very 
experienced with ROOT. See the ROOT documentation on root.cern.ch for more information on 
each of these settings. 
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4.4.3 Changing the legend 

The upper right corner of the legend defaults to the upper-right-most corner of the canvas, and has a 
width of 19% of the canvas. The height of the legend is the larger value of 20% or 3.5% x nuist, where 
nhist is the number of histograms included. Changes to the legend location and size can be made at 
the start of the generate_plot () function within the plot class in plots.cpp. 

5 Examples 

5.1 Analysis Prototyping and Data Visualization 

One of Seer’s primary functions is for analysis prototyping and data visualization, where kinematic 
distributions of signals and backgrounds can be explored with minimal effort. By adjusting a few set¬ 
tings files, many different tagging permutations and cuts can be implemented and visualized, producing 
publication quality figures. 

In a recent study of light pseudoscalar production at LHC13 [19], Seer was used to analyze 
SM and beyond the SM production of dilepton events in association with b jets generated with 
MG5-|-aMC@NLO. A total of 40.7 million events were generated, and heavy states {W, Z, a, t) 
were decayed democratically over all lepton generations (e, /r, r). Intermediate figures from this 
study are included in Figure 2, which shows di-muon event production at the LHC with at least 1 
6-tagged jet, and inclusive to light jets with pt < 40 GeV. Backgrounds of 'y* jZ + Q — 2b + Q — 2j, 
W~^W~ + 0 — 2b + 0 — 2j, W + 0 — 2b + 0 — 2j and WZ -|- 0 — 25 -|- 0 — 2j are included, as well as 
for a BSM pseudoscalar with an enhanced coupling to down-type fermions over SM Higgs couplings, 
for masses of the pseudoscalar rua = 50,80 GeV. Dimuon decays arise from direct dimuon production, 
but also as decay products from T+r“ production. 

To illustrate how Seer was used for this project, the contents of the settings files are listed below. 
The contents of the seer_files.txt file was: 

## Single W plus 0-2 bjets plus light jets 

W+0-2b+jets d 1 ./Ihcofiles/bkgs/wmbb/run_01.Ihco 5.81426 41257 
W+0-2b+jets d 1 ./Ihcofiles/bkgs/wpbb/run_01.Ihco 9.76014 40566 
## W+Z plus 0-2 bjets 

WZ+0-2b+jets d 0 ./Ihcofiles/bkgs/wmzbb/run_01.Ihco 0.000426996 50000 

WZ+0-2b+jets d 0 ./Ihcofiles/bkgs/wpzbb/run_01.Ihco 0.000693746 50000 

## WW plus 0-2 bjets, includes ttbar production 

WW+0-2b+jets d 0 ./Ihcofiles/bkgs/wwbb/run_01.Ihco 3.8891 50000 

## gcunma^/Z plus 0-2 bjets plus jets 

Z+0-2b+jets d 0 ./Ihcofiles/bkgs/llbb/run_01.Ihco 1.35258 32926 
## Signal files. 

MA=80,gd=25 d 0 ./Ihcofiles/signal/set_80_5_l.Ihco 0.923511 50000 
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Figure 2. Kinematic distributions of the signal and background for pseudoscalar mediated dilepton 
production, including backgrounds, with a signature, with no cuts made. Upper left figure is 

the total scalar sum of visible transverse momentum, upper right figure is the leading lepton pT, and 
lower figure is the scalar sum of the lepton px and the ^t- Distributions show distinctive differences 
in the distributions between signal and background. 


The ellipses refer to multiple entries of similar files. For the backgrounds, events were broken down 
into subsets of diagrams, which necessitates overriding the automatic event weight calculator in Seer 
and adding the cross section (in pb) and number of events at the end of each line (note the use of 
the d flag when overriding the automatic event weight calculator). Backgrounds involving a single W 
decaying to leptons do not meet the dilepton criteria, and so a light jet must fake either an electron or 
a tau lepton to contribute to the signal. This was calculated by using Seer to estimate fakes instead 
of Delphes. 

Dimuon events do not suffer from large fake lepton backgrounds, but 6-jet faking was a significant 
issue. For the e/r analysis, however, jets were considered to fake electrons, and so the single lU-l-jets 
channel could produce events that faked the signal. In this case, the flag of 1 for the single W 
backgrounds shown above corresponded to scheme 1 of seer_fakes.txt, which contained: 

Rates 

jet ele 0.0001 
jet tau 0.001 
jet bjt 0.001 
Schema 
scheme 0 
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scheme 1 

addreal 1 1 muo, pt 10 99999, eta -2.5 2.5 

addreal 1 2 bjt, pt 10 99999, eta -2.5 2.5 

addfake jet ele, pt 10 99999, eta -2.5 2.5 

scheme 2 

This is the entry for the e/x analysis, where the muon is expected from the decay of the W, and the 
analysis requires that there is at least 1 6-jet. For events without any 6-jets, a second entry of addfake 

jet bjt, pt 10 99999, eta -2.5 2.5 can be added to the same scheme. For each event that meets 

that criteria, these settings instruct Seer to randomly select a jet with px > 10 GeV and \rj\ < 2.5 and 

switch its type to an electron. The fake rate for this is ^ = 0.0001, and this factor is then applied to 

all such events. Not included here for brevity is a similar inclusion of dilepton events with zero 6-jets, 
in which a light jet fakes a 6-jet. 

The seer_cuts.txt file for the dimuon analysis was as follows: 

## New Cut Card - double number signs at the start of the line indicate a comment 

## Note: B-jets and hadronic taus are also considered jets unless they are explicitly tagged/removed 
Signal ## Do not alter this line! 

2 2 muo, pt 10 99999, eta -2.5 2.5 

1 2 bjt, pt 10 99999, eta -2.5 2.5 

Extra ## Do not alter this line! 

##TH1S IS CMS TRIGGER 
trigger ele pt 35 0 0 0 
trigger ele pt 23 12 0 0 
trigger muo pt 25 0 0 0 
trigger muo pt 17 10 0 0 
trigger gam pt 80 0 0 0 
trigger garni pt 40 25 0 0 

trigger tau pt 59 59 0 0 

trigger jet pt 657 000 
trigger jet pt 247 247 247 0 
trigger jet pt 113 113 113 113 
trigger tae pt 45 19 0 0 

trigger taim pt 40 15 0 0 

trigger jat pt 180 123 0 0 
trigger tat pt 86 65 0 0 
trigger bjt pt 237 000 
trigger eaim pt 23 10 0 0 

trigger eaim pt 12 23 0 0 

##1GN0RES 

ignore ele with pt between 0 10 
ignore ele with eta over 2.47 
ignore muo with pt between 0 10 
ignore muo with eta over 2.4 
ignore gam with pt between 0 10 
ignore gam with eta over 2.5 
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##CUTS 






addcut 

all 

with 

osl 

over 0 


##Below are the 

cuts for the 

2u 

addcut 
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with 
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addcut 

all 

with 

ht 

over 120 


addcut 

all 

with 

htl 

over 120 



The signal tagging required exactly two muons and either one or two 6-jets. An estimate of the 
CMS trigger thresholds was included, and detector limitations for electrons, muons and jets were 
implemented in the ignore section. 

The cuts included in this list are: an opposite sign lepton cut (eliminating same-sign lepton pairs); 
a pr < 50 GeV on the leading lepton cut (events with pT > 50 GeV were removed); a total scalar sum 
of pt of visible objects of Hx <120 GeV cut; and a scalar sum of the missing transverse momentum 
and Pt of the leptons cut of < 120. 

The seer_plotS.txt file for Figure 3 included the settings: 

PlotType = 35 
PlotType2 = 0 
PlotMaxX = 150 
PlotMinX = 0 
PlotMaxY = 10000 
PlotMinY = 0.0001 
NumBinX = 150 
NumBinY = 30 
ChooseLn = 1 
Normalize = 0 
PlotTxt = lulul 
SigPref = MA 
FileType = pdf 

Setting PlotType2 to 0 creates a single variable histogram, and the NumBinX value sets the bin width 
to be 1 GeV. Based on the SigPref entry, all entries in seerJiles.txt that have a tag starting with 
“MA” are treated as signal files and plotted as outlines, while all others are treated as backgrounds. 
Based on the FileType entry, the histogram was output as a portable document format (pdf) file. 

With the small bin sizes in Figure 3, the event rates per bin show a significant statistical fluctuation 
in bins where few backgrounds passed the tagging and cut requirements. This motivated the need for 
generating 40.7 million events, in order to get an accurate estimate of the backgrounds in each bin. 
For the ROOT file format, the file size for 50k events is larger than 500 MB. Performing this same 
analysis using information stored in ROOT files would require more than 400 GB of disk space. The 
time taken for the generation of these plots was approximately 20 minutes (MacBook Pro 2.66GHz i7, 
8GB 1066 DDR3 RAM, 5400 RPM HD). 

Seer also allows for alternative ways of visualizing the events. The plots in Figure 4 were produced 
by making a few changes to the settings files in Seer, but using the same input files. The signal is 
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Figure 3. Invariant mass distribution of pseudoscalar mediated fj,'^ production, as well as SM 
backgrounds, with cuts made to enhance signal over the background. 




Figure 4. Examples of the types of 2D histograms that can be generated by Seer. The box histogram 
(top) is produced when multiple tags are included in the seer_files.txt list. Box sizes represent the 
relative bin value. The coloured spectral histograms (lower) are produced when only a single tag is 
listed in the seer_files.txt list. Without setting Seer to produce normalized cross sections, the z-axis 
values are a values measured in fb. 


switched to the e + fi channel, which does not have the large invariant mass peak, by adjusting the 
appropriate signal lines in the seer_cuts.txt file, shown below: 

Signal ## Do not alter this line! 
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1 1 ele, pt 10 99999, eta -2.5 2.5 

1 1 muo, pt 10 99999, eta -2.5 2.5 

1 2 bjt, pt 10 99999, eta -2.5 2.5 

Using this tagging, Figure 4 was produced by combining all four background types into a single tag 
(bkg), and setting seer_plotS.txt to: 

PlotType = 35 
PlotType2 = 5 
PlotMaxX = 300 
PlotMinX = 0 
PlotMaxY = 300 
PlotMinY = 0 
NumBinX = 30 
NumBinY = 30 
ChooseLn = 1 
Normalize = 0 
PlotTxt = lelu 
SigPref = MA 
FileType = pdf 

Additionally, all of the backgrounds were given an identical tag in seerJiles.txt, and only one signal 
set was included. The top of Figure 4 was produced with both the signal and background entries 
in seer_files.txt, while the lower two were produced by commenting out the signal or background, 
respectively. This visual representation of the phase space is useful for identifying cut regions. 

5.2 Reproducing Experimental Analyses 

Seer can also be used to test MC signal data against existing experimental results. As with any 
such endeavour, the precise details of the detector simulation (such as in Delphes) and method for 
accounting for NLO effects can have a signihcant impact on the results. Included in this section are 
the results of using Seer to reproduce three experimental SUSY searches: 2-6 jet inclusive search [20], 
tri-lepton search [21], di-lepton search [22]. The routines to perform the cuts for each of these studies 
are implemented in Seer already in calculations.cpp, allowing users to perform these same searches. 
In each case, the details of the Delphes settings used are not included, as this is outside the scope of 
Seer. 

Another challenge in reproducing experimental searches is in accounting for combined exclusion 
limits. The experimental groups typically use the CLg method for determining exclusion constraints, 
which accounts for signal excesses from multiple signal regions, and accounts for correlations between 
systematic uncertainties present. The correlation matrix for the systematic uncertainties are not 
currently provided by any of the experimental groups, and so theorists are unable to perform this 
same analysis. In the studies reproduced here, exclusion contours were generated by assuming a 
boolean addition of the exclusions from each independent signal region. This naturally results in a 
more conservative contour, leading to a natural negative bias, especially near regions that are excluded 
by multiple signal regions simultaneously. 

5.2.1 ATLAS 2-6 jet inclusive search 

The 2-6 jet inclusive search [20] that was reproduced involved the mSUGRA scan over mi /2 vs mo, 
with tan j5 = 30, Aq = —2mo and /i > 0. Contrary to the conference note, which used Herwig-|--I- 
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to generate events, this reproduction employed MG5+aMC@NLO to generate 50k events of the fully 
inclusive di-sparticle cross section (all possible pairings of sparticles) at leading order, without any 
additional hard element jets. These states were then decayed and hadronized using the standard Pythia 
6.4 implementation with MG5+aMC@NLO, and Delphes 3.0 was employed to simulate detector effects. 
The parameters niQ and mi /2 were scanned over the values m-o ~ 700 — 5700 GeV (step size 500 GeV) 
and mi /2 ~ 300 — 900 GeV (step size 100 GeV). 

The search was reproduced using the following seer_cuts.txt settings: 

Signal ## Do not alter this line! 

2 99 jet, pt 10 99999, eta -5 5 
Extra ## Do not alter this line! 
trigger jat pt 80 100 0 0 
##1GN0RES 

ignore ele with pt between 0 10 
ignore ele with eta over 2.47 
ignore muo with pt between 0 10 
ignore muo with eta over 2.4 
ignore gam with pt between 0 10 
ignore gam with eta over 2.5 
ignore bjt with pt between 0 20 
ignore bjt with eta over 2.5 
ignore Ijt with pt between 0 20 
ignore Ijt with eta over 4.5 
##CUTS 
## A-loose 

addcut all with jetA over 1 
## A-medium 

## addcut with jetA over 2 
## B-medium 

## addcut with jetB over 1 
## B-tight 

## addcut with jetB over 2 
## C-medium 

## addcut with jetC over 1 
## C-tight 

## addcut with jetC over 2 
## D 

## addcut with jetD over 1 
## E-loose 

## addcut with jetE over 1 
## E-medium 

## addcut with jetE over 2 
## E-tight 

## addcut with jetE over 3 

where the appropriate signal region (A through E) was uncommented separately for each run of Seer. 
All LHGO hies for each data point were included in seer_files.txt, resulting in a total of 10 runs of Seer, 
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Figure 5. Exclusion plot for inclusive jet+^T production in mSUGRA from Seer (orange at LO, 
black with K-factor), as compared to the observed limit from the originating ATLAS study (red) [20]. 


once for each signal region. Due to the large number of LHCO used, the plotting features were disabled 
in seer_plotS.txt, as distributions of events were irrelevant and only the overall summary provided in 
the text file output was needed. A script, analysis.cpp (included in the Results folder of Seer) was 
used to read through the output files and generate a Mathematica script to plot the exclusion contour. 

The results of the calculations using Seer are shown in Figure 5. The leading order cross sections 
agree with the values in the conference note to within about 20%, depending on the point in parameter 
space, but are systematically lower. Due to the large number of QCD events in the fully inclusive cross 
section, the effect of ISR jets and other NLO effects can be quite sizeable. Instead of incorporating 
them, these effects were accounted for using a K-factor of 35%. ^ The excellent agreement between Seer 
and the experimental results suggests that Seer is sufficiently capable of analyzing and reproducing 
the experimental study. 


5.2.2 ATLAS tri-lepton search 

The tri-lepton search that was reproduced [21] focused on a simplified version of the pMSSM, where 
the squarks, gluons and Higgsinos are decoupled. This leaves three remaining relevant parameters, 
Ml ^ M^o and M 2 ~ M^o ^ M~± which are varied, and a fixed tan/3 = 10. The process pp —)■ yf y® 
was generated with 50k events, and the same process of production, decay, hadronization and detector 
simulation was performed as discussed in the previous section. 

The seer_cuts.txt settings used to reproduce this study were: 


Signal ## Do not alter this 

3 99 lep, pt 10 99999, eta 

0 99 jet, pt 20 99999, eta 

0 99 gam, pt 10 99999, eta 


line! 
-2.5 2.5 
-2.5 2.5 
-2.5 2.5 


Extra ## Do not alter this line! 


##TH1S IS ATLAS TRIGGER 


trigger ele pt 25 0 0 0 
trigger ele pt 15 15 0 0 


^This value is chosen based on the K-factors listed on https: //twiki. cern. ch/twiki/bin/view/Sandbox/ 
TestTopic232 for the mSUGRA benchmark points. Ideally, the K-factor will vary based on the particular 
parameter points. A flat K-factor was chosen as a sufficient approximation to vet the Seer code. 
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trigger muo pt 
trigger muo pt 
trigger garni pt 
trigger garni pt 
trigger jet pt 
trigger jet pt 
trigger jet pt 
trigger jat pt 
trigger tat pt 
trigger eaim pt 
trigger eaim pt 
##IGNORES 


20 0 0 0 
10 10 0 0 
60 0 0 0 
20 20 0 0 
400 0 0 0 
165 165 165 0 
110 110 110 110 
70 70 0 0 
35 45 0 0 
20 10 0 0 
10 20 0 0 


ignore ele with pt between 0 10 
ignore ele with eta over 2.47 
ignore muo with pt between 0 10 
ignore muo with eta over 2.4 
ignore gam with pt between 0 10 
ignore gam with eta over 2.47 
ignore jet with pt between 0 20 
ignore jet with eta over 2.5 
addcut bjt with num over 0 
addcut all with a31p over 1 


where the numerical value of the addcut all with a31p over 1 line was varied from 1-6 for each 
signal region. All other aspects of the analysis were similar as to the 2-6 jets study discussed previously. 
The results of this are shown in Figure 6. 

In this case, the exclusion region found using Seer is in excellent agreement for m^o — m^o ^ mz- 
In the region where the spectrum compresses, multiple signal regions provide exclusions. This results 
in an enhanced exclusion using the full power of the CLg method, as discussed, but an inability to 
precisely reproduce the exclusions using Seer. This is a limitation that cannot be resolved currently 
without more information regarding the correlations of systematic uncertainties in the experimental 
signal regions. 

The original study employed the NLO cross sections from Prospino 2.0 [23], however the reproduc¬ 
tion in Seer uses LO cross sections. Appropriate K-factors in this region are approximately 10-20%, 
based on a randomized parameter point check in Prospino over the parameter space explored. An 
overall 15% K-factor has been included as a separate contour in Figure 6. Regions where the K-factor 
significantly increases the exclusion typically have high acceptance rates. Alternatively, where the 
K-factor exclusion line is close to the LO exclusion, the acceptance rates are low. 


5.2.3 ATLAS di-lepton search 

The di-lepton search that was reproduced [21] focused on a simplified version of the pMSSM, where 
right handed sleptons (degenerate en and (2^) decay directly to a lepton and a bino-type lightest 
neutralino. This results in a similar three parameter subset of the pMSSM, with a scan over the mass 
of the sleptons mi and the bino mass Mi ~ ^ fixed tan/3 = 10. The process pp —>■ was 

generated with 50k events using the same generation, decay, hadronization and detector simulation 
discussed previously. 
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Figure 6. Exclusion plot for chargino+neutralino production in the three lepton final state from Seer 
(orange at LO, black with K-factor), as compared to the observed limit from the originating ATLAS 
study (red) [21]. Lightest neutralino masses smaller than 10 GeV were not generated as part of this 
analysis. 


The seer_cuts.txt settings used to reproduce this study were: 


Signal ## Do not alter this line! 

2 2 lep, pt 10 99999, eta -2.5 2.5 
Extra ## Do not alter this line! 


##TH1S IS ATLAS TRIGGER 
trigger ele pt 25 0 0 0 
trigger ele pt 15 15 0 0 
trigger muo pt 20 0 0 0 
trigger muo pt 10 10 0 0 
trigger garni pt 60 0 0 0 
trigger garni pt 20 20 0 0 
trigger jet pt 400 000 
trigger jet pt 165 165 165 0 

trigger jet pt 110 110 110 110 

trigger jat pt 70 70 0 0 

trigger tat pt 35 45 0 0 

trigger eaim pt 20 10 0 0 

trigger eaim pt 10 20 0 0 

##1GN0RES 


ignore ele 
ignore muo 
ignore gam 
ignore gam 
ignore jet 
ignore jet 
addcut all 


with eta over 2.47 
with eta over 2.4 
with pt between 0 20 
with eta over 2.4 
with pt between 0 20 
with eta over 4.5 
with a21p over 2 
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m, (GeV) 

Figure 7. Exclusion plot for right handed slepton pair production from Seer (orange at LO, black with 
K-factor), as compared to the observed limit from the originating ATLAS study (red) [22]. Slepton 
masses smaller than 100 GeV were not generated as part of this analysis. 


where the numerical value of the addcut all with a21p over 1 line was varied from 1-5 for each 
signal region. All other aspects of the analysis were similar as to the 2-6 jets study discussed previously. 

Very similar exclusion regions were found, however for the region of large the Seer exclusion 
does not lie within the ±lcr boundary for the exclusion. This is due to the very low uncertainties in 
the signal, which results from the exclusive nature of the search (all events are vetoed for the presence 
of any jet with pj- > 20 GeV). One particular explanation for the difference in the exclusions is that 
Delphes employs a flat efficiency for lepton identification. This is one area where CheckMATE has 
improved upon the standard Delphes implementation available with the MG5-|-aMG@NLO package 
by incorporating many more details regarding lepton tagging and identification. None-the-less, Seer 
performs exceptionally well in reproducing this study, even with the standard Delphes treatment of 
leptons. 

A similar K-factor to the tri-lepton study was included in this analysis, to show the dependence 
on the signal cross section. Appropriate K-factors found using Prospino are on the order of 10-20%, 
however the jet veto employed in this analysis would suggest that an appropriate K-factor is more 
challenging to determine. Additionally, and since the K-factor is parameter point dependent, this 
value is included purely as a qualitative descriptor of the effect of increasing the signal cross section 
on the exclusion results. 

6 Summary 

Seer is a useful tool for analyzing LHCO output files for collider processes that have gone through a 
detector simulation like PGS or Delphes. With a simple structure to use, and an easy-to-modify script 
system for the implementation of new cuts and kinematic variables. Seer can be rapidly employed for 
exploring kinematic distributions as well as an analysis of trigger response, cut efficiencies, and even 
full analyses of experimental processes. 

There are a number of primary uses for this tool. The most obvious would be in visualizing 
kinematic distributions and exploring the kinematics of a signal. The second would be in reproducing 
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experimental analyses. The third is in testing new kinematic variables and distributions. For example, 
stransverse variables are of particular interest for SUSY searches, and new variants are still being 
developed. The Seer code structure provides a simple framework that allows for rapid implementation 
of these new test variables. 

Alternative code packages such as MadAnalysis 5 and CheckMATE include different features and 
different interfaces, with different learning curves. Seer presents an easily usable, easily modifiable, 
powerful and effective way to analyze the results of simulation data. 
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